XQuake: an XML-based Knowledge Discovery Environment

نویسندگان

  • Andrea Romei
  • Franco Turini
  • Jean-François Boulicaut
  • Taneli Mielikäinen
  • Giorgio Ghelli
چکیده

Data mining is the analysis of large volumes of data to find unsuspected relationships and to summarize the data in novel ways, that are both understandable and useful to the data owner. Nowadays, the rapid growth of semi-structured sources raises the need of designing and implementing environments for data mining out of XML data. On the basis of the principles of the inductive database theory, this dissertation presents a flexible data mining system with capabilities of obtaining, maintaining, representing and querying induced, deduced and prior knowledge, stored inside native XML databases. In particular, it summarizes our three-years experience in the design and development of XQuake, a query language that extends XQuery to support mining primitives. Features of the language are an intuitive syntax, a good expressiveness, and the capability of dealing uniformly with data mining entities. A detail of its implementation and the evaluation of its performance are also given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XQuake as a Constraint-Based Mining Language

XQuake is a language for data mining inspired by the inductive databases theory. This work extends XQuake with the definition of domain-specific constraints. An ontology is used to describe the domain knowledge. We give the main idea of the work-inprogress discussing its possibilities and advantages.

متن کامل

Grid-Based Knowledge Discovery Services for High Throughput Informatics

Discovery Net is an application layer for providing gridbased knowledge discovery services. These services allow scientists to create and manage complex knowledge discovery workflows that integrate data and analysis routines provided as remote services. They also allow scientists to store, share and execute these workflows as well as publish them as new services. Discovery Net provides a higher...

متن کامل

Querying Compressed Knowledge Bases in Pervasive Computing

In the so-called Semantic Web of Things (SWoT), annotated information is tied/derived to/from micro-devices, such as RFID tags and wireless sensors, deployed in an environment. Compression techniques are so needed, due to the verbosity of semantic XML-based languages. Beyond compression ratio, query efficiency is a key aspect for knowledge discovery in mobile ad-hoc scenarios where resources ar...

متن کامل

An XML Framework Proposal for Knowledge Discovery in Databases

In recent years, the XML language has been receiving much interest among IT community. It has many nice properties that make it a great candidate for representation of different kinds of data. In this paper we will propose an XML framework for the domain of knowledge discovery in databases. This is not a specification document; this article tries to be a compendium of ideas and remarks concerni...

متن کامل

Data Mining and XML: Current and Future Issues

The paper describes potential synergies between data mining and XML, which include the representation of discovered data mining knowledge, knowledge discovery from XML documents, XML-based data preparation, and XML-based domain knowledge. Each category is viewed from a theoretical as well as a practical point of view.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009